The linear Ising model and its analytic continuation, random walk 
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A generalization of Gauss's principle is used to derive the error laws corresponding to Types II 
and VII distributions in Pearson's classification scheme. Student's r-pdf (Type II) governs the dis- 
tribution of the internal energy of a uniform, linear chain, Ising model, while analytic continuation 
of the uniform exchange energy converts it into a Student t-density (Type VII) for the position of a 
random walk in a single spatial dimension. Higher dimensional spaces, corresponding to larger de- 
grees of freedom and generalizations to multidimensional Student r- and t-densities, are obtained by 
considering independent and identically distributed random variables, having rotationally invariant 
densities, whose entropies are additive and generating functions are multiplicative. 
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A relation between the Ising problem and random 
walks has been conjectured a number of times over the 
years Q. Rather than one of technique, we shall show in 
this letter that the two are inter convertible through ana- 
lytic continuation of the uniform exchange energy which 
transforms a probability density function (pdf) of finite 
range into one of infinite range. Specifically, a Student r- 
pdf 0, of limited range, which governs the distribution 
of the internal energy of a linear, uniform Ising chain, 
is transformed into Student's t-pdf, of unlimited range, 
for the position of a random walker along an infinite lin- 
ear chain, through analytic continuation of the uniform 
exchange energy between spins. These two families of 
distributions will be derived as error laws from a gener- 
alization of Gauss's principle, which will serve in uniting 
these two seeming unrelated phenomena. 

Gauss's principle asserts that if the probability of an 
error, given the true value, is a function of the error 
alone, and the log-likelihood function is maximum when 
the true value coincides with the mean of the observed 
values, then the law of error must be normal How- 
ever, many other error laws exist, such as the beta pdf, in 
which the probability of an error is not a function of the 
error alone 0, pp. 165]. Moreover, other error laws tend 
to the normal law only in the asymptotic limit, where 
the number of observations increases without limit. Stu- 
dent's r- and i-pdfs fall into this category. 

A law of error for determining the probability of devi- 
ations from a location parameter A is given in the form: 

P(dx\X) =p(x- X)dx. (1) 

The method of maximum likelihood switches the roles 
of variate, x, and parameter, A, thereby converting the 
probability Q into a likelihood function. Maximum like- 
lihood is determined from a solution of 
d 



dX 



log£(A) 



where the prime stands for differentiation with respect 
to A. 

If the arithmetic mean, 



- A) = 0, 



(3) 



is the most probable value of the quantity measured, then 
the normal law follows by integration of the log- likelihood 
equation. The log-likelihood equation is formed by set- 
ting deviations from J2J proportional to deviations from 
P)l , where the constant of proportionality is conveniently 
taken as the second derivative of an, at least, twice dif- 
ferentiable continuous function The normal law cor- 
responds to the case of a negative unit weight 



p'(xi - A) 



(xi - X)p(xi - A) 



(4) 



A) 



■A p'(xj - A) 

hi p{ - Xl - A ) 



0, (2) 



Integrating and using the integration function to com- 
plete the square yields the normal pdf. For any other 
constant weight, we would get the weighted average 0, 
pp. 214] 

n 

^2<j i (x l - A) = 0, 
i=l 

with X)"=i a i = 1> rather than the arithmetic mean, 

Actually, Q must be true for whatever i is chosen. 
Hence, it must be independent of i, and, at most, can be 
a function of the location parameter A. Except for the 
normal law, where a is a negative constant, the Poisson 
law, where a = — A -1 , the binomial, and negative, bino- 
mial laws, where <r T = — [A(l T- A)] -1 , respectively, other 
error laws cannot be derived from the conventional form 
of Gauss's principle One example is Student's i-pdf, 
even though it tends to the normal law as the number of 
degrees of freedom increases without limit. The i-pdf is 
nearly normal at the center but it has fatter tails than the 
normal pdf because higher order moments do not exist. 
Another example is Student's r-pdf, notwithstanding the 
fact that all the moments of the distribution exist. 
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We have identified the weights, cr,, as the second 
derivatives of the entropy. The weights we will consider 
here H pp. 210-211], 



it 



(x - \)< 



(5) 



will lead to Pearson distributions of Types II (negative 
sign) and VII (positive sign). We assume that the in- 
formation regarding the values of the location, A, and 
scaling, s, parameters depend on || p. 379] 

x — A 



alone. 

In the case of Type II distributions a single integration 
of <7_ gives the 'equation of state' 



tanh u, 



(6) 



where we will soon appreciate (3 as the intensive variable 
conjugate to the extensive variable u. In essence, © con- 
tains the statement of the second law of thermodynamics 

- = A 

du 

where S(u) is the entropy. A further integration yields 
the entropy reduction 



AS(u) := S(u) - 5(0) 
= — utanh~ u— | log (l — u 2 ) 



(7) 



const. 



where S(0) is the maximum entropy in the 'high temper- 
ature limit' (3 — > 0, where u = 0. 

Introducing the equation of state © into JjJ gives 



AS = —(3 tanh (3 + log cosh (3 + const. 



(8) 



Setting (3 — J/T, where J is the uniform exchange en- 
ergy between spins, and T the absolute temperature in 
energy units where Boltzmann's constant is unity, u in 
is identified as the ratio of the internal energy per 
unit spin to the exchange energy, and © is the zero-field 
entropy reduction per spin of the uniform, linear chain, 
Ising model 6]. Since there are 2 m configurations, the 
maximum entropy per spin is S(0) = log 2. 

For m spins Q will be m times as large. Viewed as 
the Legendre transform 

AS(u)-^-u = logZ(/3), (9) 
the generating function per spin is identified as 

Z(fi) = cosh/3. (10) 
The generating function for m spins is 

Z m {[3) =cosh™- 1 /3, (11) 



which is the hallmark of an infinitely divisible distribu- 
tion 0. 

The Legendre transform © is asymptotically equiva- 
lent to the Laplace transform 



Z m (f3) 



-m(/3«-AS(«)) du 



(12) 



in the limit asm-* oo. The function, (3u — AS(u), has a 
minimum on [—1,0] at the interior point @). Employing 
Laplace's method, the integral l|12f> can be evaluated by 
developing the exponent in a Taylor series to second order 
about the minimum ©, and using the fact that m is so 
large that the value of the resulting Gaussian integral is 
barely changed when the lower limit of integration is set 
equal to — oo. We then obtain 



Z m {(3) 



cosh 



m — 1 







for the expression of the generating function of m spins. 
Consequently, l|10|l and (fill) are logarithmically equiva- 
lent. 

The entropy reduction determines the univariate pdf 
through Boltzmann's principle 0: 

p(u) = Ke AS(u \ (13) 

where if is a constant. Introducing Q) into (|13|l gives 

a — u tanh -1 u l 



p(u) = K- 



{1-U 2 )2 



K (1 - u 2 ) 2 + O (w 4 ) . (14) 



Consider a uniform distribution of mass inside a hyper- 
sphere of m-dimensions with unit radius. The energies 
U\ , . . . , U m of m spins are supposed to be independent 
and identically distributed (iid), and, in addition, they 
have rotationally invariant densities. Such a hypersphere 
will be spanned by the coordinates, ui, . . . , u m . The Eu- 
clidean distance from the origin to any point 

|u| = (u? + .-. + u^)3. 



The section at |u| has radius y/l — u 2 , and an 'area' pro- 
portional to (1 — u 2 )2( m_1 ). Hence, it has a density 

Pm (u) = Ke^ m -^ AS ^ (15) 

~ K (1 - u 2 y {m - 1] + O (u 4 ) . 

The multidimensional pdf, l|15|l . shows that we are es- 
sentially dealing with a beta pdf 

Pm {u) = K(l + ufi^-VQ. - u)^ m ~V + O (« 4 ) , 

defined over [— 1,+1], rather than over [—1,0]. In fact, 
the entropy reduction, J7J) can be written as an entropy 
of mixing 

AS(u) = -i {(1 + u) log(l +«) + (!_ u ) log(l - u)} . 
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The distribution of mass inside a spherical layer whose 
radius varies from (1 — e) to 1, with e an infinitesimal 
quantity is H p. 191] 

Pm(u) = K {(1 - u 2 )^ m ^ - [(l - e f - u 2 ]^" 1 - 1 ^} 

where the constants have been absorbed into the new 
definition of K. This is the same pdf as Ijl5|l . except 
that m has been reduced by a factor of 2. A change of 
variable, u = cos 9, converts it into 

Pm (9) =Ksm m - 2 0. (16) 

For a sphere, m = 3, and it is well-known that two points 
on its surface subtend an angle between 9 and 9 + d9, 
at the center of the sphere, if and only if the second 
point falls in a ring of area 2nsm9d9. This occurs with 
probability 2tt sin 0(19/ in = I sin 9 d9 |To|. 

The approach to the normal pdf can be viewed in the 
following terms. As m increases, jumps to the left and 
right will tend to equalize, like heads and tails, so that the 
distribution will tend to concentrate around the origin. 
To avoid this, and derive the shape of the asymptotic dis- 
tribution, we introduce the scaling u — > u/\fm 8, p. 59] 

so that (l — u 2 /m) 5 ^ m ^ — > e~5 n2 . The normal distri- 
bution is obtained in the limit without having to invoke 
any assumptions regarding the central limit theorem. 

The form of the pdf l|15[) agrees with that found from 
the log-likelihood equation for the centering parameter A 
i p. 210] 

Al g£(A)^ ^ 
Integrating gives the likelihood function 

of a Type II law. Apart from the norming constant, the 
likelihood function is seen to be the product of pdfs of 
the form (TSJ. 

Analytic continuation of the exchange energy, (i.e., 
J — ► i J) converts the thermal equation of state © into 1 

/3 = tan" 1 w, (17) 



1 Feller llll gives a 2-dimensional example where the random equa- 
tion of state, II Yl ■ for u has the density 1221 with m = 2. It 
consists of a light source at the origin on a wall and a rotating 
mirror whose axis of rotation is at a unit distance from the wall. 
The angle of rotation is distributed uniformly between — ^tt and 
^tt, according to 1171 . and u represents the distance between the 
light source and the point where the reflected ray intersects the 
wall. The Cauchy pdf represents the distribution of the inten- 



where j3 — > i(3, and u — > — iu in © 2 . What was an 
inverse temperature has now become an angle, (3. Such 
a process simulates a symmetric walk on an infinite 1- 
dimensional lattice, where what was the dimensionlcss 
internal energy per unit spin, u, is now the position of 
the walker along the chain. Since the interval for f3 is 
[— 57r, |7r], the walker can wander off to ±oo. 

On account of the properties of concavity and convex- 
ity, the equation of state lfTT)l is the negative derivative of 
the entropy and the positive derivative of the logarithm 
of the generating function, respectively. Differentiating a 
second time shows that we are dealing with a Type VII 
distribution, whose weight is given by <r+ in (JSJ). Inte- 
grating (|17|l gives the entropy reduction 

AS(u) = -utan" 1 u + | log (l + u 2 ) + const. (18) 

For small u, the entropy reduction (|18fl reduces to the 
negative quadratic form of the normal law, 

AS{u) ~ -1 log (1 + u 2 ) -> -\u 2 . 

Introducing Ijl7(l into (|18(l enables the comparison of 

AS = -(3 tan /3 - log cos /3 + const., (19) 

and (JSJ . Under (3 — > i/3, one becomes the negative of the 
other. On the strength of the Legendre transform, 0, 
the logarithm of the generating function can be identified 
from (fT§|l as 

logZ(/3) = -log cos /3. (20) 

The characteristic function of a random walk model 
with equal a priori probabilities of a jump to the left 
or to the right is cos/3 3 . This symmetric Bernoullian 



sity of light along lines of constant i>, the axis orthogonal to u 
in the u, D-plane. The convolution property of the Cauchy pdf, 
according to Feller, is a statement of Hughen's principle which 
states that intensity of light is the same for a source located at 
the origin along the line v = 1, as it would be if the source 
were distributed along this line and the intensity were measured 
along v = 2. Moreover, if we make n measurements, the average, 
(Ui + U2 + . ■ ■ + U n )/n, has exactly the same pdf as that of a sin- 
gle measurement, so that nothing is gained by making repeated 
measurements. 

2 A similar transformation leads from aperiodic to periodic be- 
havior in Newton's reiteration scheme. An approximate root 
x n of the equation g(x) = has £ n +i = x n — g(x n )/g' (x„) 
as its next approximation. For g(x) = x 2 — 1, this reads 
x n+1 = \{x n + l/x n ). Setting x n = coth0 n gives 9 n+1 = 20 n , 
without any trace of periodicity. However, by making x purely 
imaginary, x = iy, the iterative scheme y n +i = \ (yn — 1/j/n) has 
the solution y = — cot(7n?), which demands that i? n +i = 2i? n 
mod 1. Now, initial aperiodic behavior can lead to preperiodic 
orbits by an appropriate cho ice of a rational number for i} n , in- 
stead of to a fixed point \12\. 

3 The generating function exists for all real f3 in all cases where 
the pdf has a finite range, and in the case of the normal law 0, 
p. 87]. In cases where the integral exists, the logarithm of the 
generating function will be equal in magnitude and opposite in 
sign to the logarithm of the characteristic function. 
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distribution has mass +| at sites +1 and —1. Since the 
jumps are iid, the characteristic function of m jumps is 
simply cos m [3. The inverse Fourier transform 



By integration we find 



p(u) 



m r 9 7T 



7T . /I 

2 W 



1 

2 m V |(m + it) 



-u 2 /2r, 



gives the pdf, which in the large TO-limit, transforms into 
the normal pdf. The approach to the normal pdf can be 
obtained from Boltzmann's principle, 1131) . 

With the entropy reduction given by (jl8|) . the pdf for a 
single jump of length u is given by Boltzmann's principle, 

P (u) = k (i + u 2 y e - utan ~ lu . 

Expanding the exponent, we get 

p(u) = K(l + u 2 yh +0 (u 4 ) . (21) 
If the steps, Ui, U%, . . . , U m , are iid, then the length, 

U=||tf|| = (tt? + --- + U^, 

will have an m-dimensional pdf of the form 4 

Pm (u) =K(l + u 2 y^ n + 0{u 4 ). (22) 

The change of variable, u — cot will transform IL'21) into 
G3|p. 193]. 

The form of the pdf (|2*2*)l is, again, supported by the 
log-likelihood equation for the centering parameter A 0, 
p. 210] 

d log£(A) = £ r Xi ~ X 



dX 



^ l + (xi- \) 2 /m 



4 The distinction between 1221 and the Cauchy distribution in 5R m 
is the following. For m = 2, the probability that a vector in a 
randomly chosen direction with a length R will be greater than 
1 . . 

r is Pr(i? > r) = (1 + r 2 ) 2 [n], p. 71]. The extreme obser- 
vations that m — 1 will be greater than r while the remaining 
one will lie in the range dr will have a density proportional to 

r(l + r 2 ) _ 2^ m+1 \ The change of variable, r = ^fx will con- 
vert it into a beta density of the second kind, proportional to 

(1 + x) — ' 2 m+1 '. Likewise, the property of extreme observations 

for Pr(R > r) = (1 — r 2 )2 will lead to a beta density of the 
first kind, which is closely allied to order statistics. For extreme 
observations of largest value, Boltzmann's principle 1131 must 
be modified in such a way that the logarithm of the tail of the 
distribution is proportional to the entropy reduction ll.'J . 



AA)=n i+ 



{Xi - \f 



rn 



for the likelihood function of a Type VII law. To within 
a constant factor, the likelihood function is a product of 
pdfs of the form (J22| . 

The pdf l|22l) is a generalization of the 1-dimensional 
Student i-pdf with m degrees of freedom to an Tri- 
dimensional distribution having the same number of de- 
grees of freedom p, p. 194]. In the limit as m — > oo, l|22|) 
transforms into the normal pdf, 



p{u) 



2tt 



(23) 



where the norming constant K 



lim 



{ m -^ jB (i,i( TO _l ) )]- 1 }^(2 7r )- 



and the scaling u — > u/y/m has been introduced. Stu- 
dent's ^distribution (|22ll can, therefore, rightly be con- 
sidered the last stop before normality for large, but finite, 
rn . 

We have derived Student's t-pdf from Boltzmann's 
principle, l|13|) . where the entropy l|18fl is independent of 
the number of degrees of freedom. As a consequence 
of their statistical independence, the entropy is additive. 
For m idd random variables, or degrees of freedom, the 
total entropy is m times as great as the entropy of a sin- 
gle variate, while the joint pdf is the product of functions, 
and this can only be true if there is a logarithmic relation 
between the two. If this were not the case, it would be im- 
possible to talk about the entropy of a single constituent 
of the system, or the probability of a single event. Conse- 
quently, any putative expression for the entropy which is 
defined in terms of the degrees of freedom of the system 
|l4l ] must be considered not as an entropy, but, rather, 
as an interpolation formula between two bona fide ex- 
pressions of the entropy. A prototype is the Renyi en- 
tropy, which is an interpolation formula connecting the 
Boltzmann-Hartley and the Gibbs-Shannon entropies as 
a parameter ranges over its permitted values (i.e., those 
that preserve the property of Schur-concavity 
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